Topic and language specific internet search engine

نویسندگان

  • Domonkos Tikk
  • György Biró
  • Ferenc Szidarovszky
  • Zsolt Tivadar Kardkovács
  • Gábor Lemák
چکیده

In this paper we present the result of our project that aims to build a categorization-based topic-oriented Internet search engine. Particularly, we focus on the economic related electronic materials available on the Internet in Hungarian. We present our search service that harvests, stores and makes searchable the publicly available contents of the subject domain. The paper describes the search facilities and the structure of the implemented system with special emphasis on intelligent search algorithms and document processing methods.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Categorization-based Topic-oriented Internet Search Engine

In this paper we present the result of our project that aims at build up a categorization-based topic-oriented Internet search engine. Particularly, we focus on the economic related electronic materials available in Hungarian on the Internet. We present D. Tikk et al. Categorization-based Topic-oriented Internet Search Engine 234 our search service that harvests, stores and makes searchable the...

متن کامل

Building Topic Specific Language Mo Competitive Mo

The ability to build topic specific language models, rapidly and with minimal human effort, is a critical need for fast deployment and portability of ASR across different domains. The World Wide Web (WWW) promises to be an excellent textual data resource for creating topic specific language models. In this paper we describe an iterative web crawling approach which uses a competitive set of adap...

متن کامل

The Core of a Topic-Specific Search Engine: How to Create It

A technique for gathering scientific, narrow topic-related documents from the Internet is presented. It has been successfully applied to compile a large Japanese collection of algorithms and their applications. Key-Words: Search Engine, Similarity Metrics, Crawler

متن کامل

Subwebs for specialized search

We describe a method to define and use subwebs, user-defined neighborhoods of the Internet. Subwebs help improve search performance by inducing a topic-specific page relevance bias over a collection of documents. Subwebs may be automatically identified using a simple algorithm we describe, and used to provide highly-relevant topic-specific information retrieval. Using subwebs in a Help and Supp...

متن کامل

The Mechanics of a Deep Net Metasearch Engine

The Deep Net refers to the thousands of topic-specific search engines on the Internet, including those that are inaccessible to traditional crawler-based search engines. Commercial metasearch engines have been slow to provide a simple, universal interface to these smaller topic-specific search engines. Turbo10 has developed a commercial metasearch engine that connects to these resources en mass...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • Acta Cybern.

دوره 18  شماره 

صفحات  -

تاریخ انتشار 2007